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Abstract 

Bifurcations can cause dynamical systems with slowly varying parameters to transi- 
tion to far-away attractors. The terms "critical transition" or "tipping point" have been 
used to describe this situation. Critical transitions have been observed in an astonishingly 
diverse set of applications from ecosystems and climate change to medicine and finance. 
The main goal of this paper is to give an overview which standard mathematical theories 
can be applied to critical transitions. We shall focus on early-warning signs that have 
been suggested to predict critical transitions and point out what mathematical theory 
can provide in this context. Starting from classical bifurcation theory and incorporating 
multiple time scale dynamics one can give a detailed analysis of local bifurcations that 
induce critical transitions. We suggest that the mathematical theory of fast-slow systems 
provides a natural definition of critical transitions. Since noise often plays a crucial role 
near critical transitions the next step is to consider stochastic fast-slow systems. The 
interplay between sample path techniques, partial differential equations and random dy- 
namical systems is highlighted. Each viewpoint provides potential early-warning signs for 
critical transitions. Since increasing variance has been suggested as an early-warning sign 
we examine it in the context of normal forms analytically, numerically and geometrically; 
we also consider autocorrelation numerically. Hence we demonstrate the applicability of 
early-warning signs for generic models. We end with suggestions for future directions of 
the theory. 

Keywords: Critical transition, tipping point, multiple time scales, bifurcation delay, stochas- 
tic dynamics, Fokker-Planck equation, noise-induced transitions. 

1 Introduction 

In this paper "critical transitions" or "tipping points" are viewed from the perspective of dy- 
namical systems. Our aim is to point out that various observations, assumptions and ideas 
developed in diverse scientific disciplines can be expressed naturally using standard mathemat- 
ical theory. In particular, we hope that this paper can be viewed as a mathematical complement 
to the excellent review by Scheffer et al [75]. A non-mathematical working definition of a critical 
transition is an abrupt change in a dynamical system. To illustrate the concept we list four 
examples 
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• In ecosystems rapid changes to desertification or extinctions of species can occur [76j [77] . 

• Medical conditions can quickly change from regular to irregular behavior; examples are 
asthma attacks [SZ] or epileptic seizures [BT] . 

• Financial markets can transition from a balanced market to a financial crisis [60J. 

• Changes in the climate and its constituent subsystems can occur abruptly [TtH I5TI IT]. 

It is clear that we would like to understand and predict these phenomena. At first glance it 
might be surprising that all four examples have anything in common as they arise in completely 
different contexts and situations. Nevertheless, it has become apparent that critical transitions 
share several attributes [TjJ [75] : 

(1) An abrupt qualitative change in the dynamical system occurs. 

(2) The change occurs rapidly in comparison to the regular dynamics. 

(3) The system crosses a special threshold near a transition. 

(4) The new state of the system is far away from its previous state. 

Furthermore, significant progress has been made in predicting a critical transition before it 
occurs. The goal is to infer from previous data when a catastrophic shift in the dynamics is 
going to occur. Ideally we would like to have a comprehensive list of early-warning signs. A 
variety of system-specific criteria could be introduced; but we are more interested in generic 
indicators that are expected to be applicable to large classes of transitions. The following 
assumption will be of major importance [75] 

(5) There is small noise in the system i.e. the data has a major deterministic component with 
small "random fluctuations" . 

There are several characteristics that have been observed in systems before critical transi- 
tions. We shall only list a few of them here: 

(6) The system recovers slowly from perturbations ("slowing down"). 

(7) The variance of the system increases as the transition is approached. 

(8) The noisy fluctuations become more asymmetric. 

(9) The autocorrelation increases before a transition. 

Figure [TJ shows time series with critical transitions; the times series have been generated by 
simulating two generic models discussed in Sections WM for the fast-slow fold and transcritical 
bifurcations. Many natural questions arise regarding analysis and comparison of these two 
time series. The observations (l)-(9) are extremely important for this purpose. However, it is 
desirable to embed these observations into a mathematically precise description of the system 
dynamics and to identify them in generic models. Relations to bifurcation theory and some in- 
dicators have been partially analyzed using statistical techniques such as autoregressive models 
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Figure 1: Two time series with critical transitions. The red dashed vertical lines have been 
added to indicate where a clear visual change in the time series behavior appears; both series 
have been generated using fast-slow stochastic dynamical systems: (a) fold and (b) transcritical. 
The time series for the generic models we propose resemble time series from experiments. 

The major goal of the current paper is to argue that the observations (l)-(9), that are 
usually made in applications, should be understood from a mathematical viewpoint by using 
deterministic and stochastic multiscale dynamical systems. We shall try to provide an overview 
which mathematical concepts and tools can be used to develop a theory of critical transitions. 
The natural starting point is bifurcation theory [3"lj I5T)] and in this respect our approach is 
closest to recent work by Sieber and Thompson [83j El] that realizes the need for a detailed 
bifurcation-theoretic analysis of tipping points. Given a dynamical system, such as a differential 
equation or iterated map, bifurcation theory can be used to classify qualitative transitions un- 
der the variation of parameters. It has been successfully applied in fields ranging from physics, 
engineering and chemistry to modern developments in neuroscience and mathematical biology 
[80J. Here we shall focus on differential equations to simplify the discussion but remark that 
discrete time systems can also be studied from this perspective. As a second step we introduce 
stochasticity into the dynamics. Our approach is closest to the work by Berglund and Gentz 
[T9j H3] ; they demonstrated the applicability of stochastic multiscale differential equations in 
a variety of contexts such as climate modeling and neuroscience. Here we point out what 
their approach implies for critical transitions. We also use numerical simulation of "normal- 
form" -type models and compare our results to theoretic results obtained from Fokker-Planck 
equations. Our numerical approach can also provide benchmark data for time series analysis 
methods [551 HE]. 

The structure of the paper is as follows. In Section [2] we explain why the differential 
equations describing critical transitions should have multiple time scales. The focus will be on 
two scales described by a fast-slow system. We suggest that fast-slow systems theory provides 
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a natural definition for a critical transition and check that certain bifurcations satisfy this 
definition. Furthermore we review the basic calculation for "slowing down" and point that 
slowing down differs for different types of critical transitions. In Section [3] we review the 
concept of normal hyperbolicity that separates regular fast-slow system dynamics from dynamic 
bifurcations; the application in the context of critical transitions is indicated. In Section H] we 
recall some basic tools and viewpoints in stochastic analysis. Stochastic fast-slow systems 
are introduced and some recent results are stated. The case of a fast-slow stochastic system 
away from a critical transition is defined and its properties are investigated; this provides a 
comparative method to detect transitions and also to estimate system parameters. In Section 
[5] we review recent progress in stochastic bifurcation theory and show how this theory should 
apply to and interact with a critical transition involving noise. The concept of stochastic 
bifurcation is much less developed. However, we introduce an example that shows that so-called 
P-bifurcations can be early-warning signs. In Section E] noise-induced phenomena are discussed 
that have recently been discovered in many mathematical models. The prediction of critical 
transitions is more complicated in this context and a numerical example illustrates this point. 
In Section [7] we consider critical transitions under the simplest mathematical assumptions. We 
calculate the variance of distributions of trajectories near a stochastic critical transition point 
using a Fokker-Planck approach. In Section [S] we use numerical simulations to expand on our 
modeling approach and we discuss the variance as an indicator if only a single sample path is 
available. Section [9] extends the numerical simulation approach to autocorrelation. Section [10] 
concludes the paper with a discussion of the current state of the theory and a discussion of 
topics we omitted. We also sketch some directions for future work. 



2 Fast-Slow Systems I: Critical Transitions 

We start with the deterministic theory. Our first goal is to make the term "critical transi- 
tion" mathematically more precise. Consider the parametrized family of ordinary differential 
equation: 

^=x' = f(x;y) (1) 

where x G M m are phase space variables and y G lR m represent parameters. A general statement 
that can often be found in the description of critical transitions in applications is that "a 
parameter evolves slowly until the tipping point is reached" . Therefore it is a natural approach 
to include the parameters into the original differential equation. The parametrized family (JT]) 
can be written as 

x' = f(x,y), f v 

y' = 0. {l} 

Using (J2J) it is easy to incorporate slowly varying parameters by adding a slow evolution to y 

x' = f(x,y), , 3 * 

y' = tg(x,y), 

where < e 1 is a small parameter and g is assumed to be sufficiently smooth. In many cases 
it suffices to assume that the parameter dynamics is de-coupled from phase space dynamics and 
one assumes g = 1. The ODEs ([3]) form a fast-slow system where the variables x G M. m are 
the fast variables and y G M m are the slow variables. The parameter e describes the time scale 
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separation. We point out that the "inclusion of dynamic slow parameters" is entirely standard 
and well-known in the theory of multiple time scale dynamics. 



Remark: A new introductory book to fast-slow systems is currently being written [53] . The 
book is going to include the deterministic theory as well as numerical and stochastic compo- 
nents that are relevant in Sections HUH Classical references for deterministic fast-slow systems 
are \%2\ E31 E2] ■ In the current paper we restrict ourselves to review the necessary definitions 
and concepts that can directly be applied to critical transitions. 

Equation (J3]) can be re-written by changing from the fast time scale t to the slow time scale 
t = et 



e§ = ex = f(x,y), 
V = g(x,y). 



% _ ~ _ J JZ'"C ( 4 ) 



The first step to analyze a fast-slow system is to consider the singular limit e — > 0. In the 
formulation ([3]) this yields the parametrized family (|2j) which is also known as the fast subsystem 
or layer equations. Considering the singular limit in (j3J) gives the slow subsystem or reduced 
system 

= f{x,y), , 5 v 

v = g{x,y)- 

The associated subsystem flows are naturally called the fast flow and the slow flow. Equation 
(JSJ) is a differential-algebraic equation so that the slow flow is constrained to 

C = {(x,y)ER m+n :f(x,y)=0}. 

The set C is called the critical set or the critical manifold if C is manifold. The points in C 
are equilibria for the fast subsystem (jSJ). C is normally hyperbolic at p £ IR m+ ™ if the matrix 
(D x f)(p) is hyperbolic i.e. all its eigenvalues have non-zero real parts. If all eigenvalues have 
negative/positive real parts then C is attracting/repelling at p\ if C is normally hyperbolic and 
neither attracting nor repelling we say it is of saddle-type. For a normally hyperbolic critical 
manifold the implicit function theorem gives 

C = {(x,y)ER m+n :h (y)=x} 

where ho : M. n — > W 71 satisfies f{ho(y),y) = 0. Then the slow flow can be written as 

V = g(ho(y),y). 

Fenichel's Theorem |28l|88j|85] provides a complete description of the dynamics for normally 
hyperbolic invariant manifolds. 

Theorem 2.1 (Fenichel's Theorem). Suppose S = Sq is a compact normally hyperbolic sub- 
manifold (possibly with boundary) of the critical manifold C . Then for e > sufficiently small 
there exists a locally invariant manifold S t diffeomorphic to Sq. S t has a Hausdorff distance of 
0(e) from Sq and the flow on S t converges to the slow flow as e — > 0. 

S e is called a slow manifold. Different slow manifolds S e lie at a distance 0(e~ K l e ) from each 
other and so we will often simply refer to "the" slow manifold as the choice of representative 
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is irrelevant for many asymptotic results. A normally hyperbolic critical manifold Cq has 
associated local stable and unstable manifolds 



W'{C ) = |J W s (p), and W U (C ) = |J W u (p), 
pec pec 

where W s (p) and W u (p) are the local stable and unstable manifolds of p as a hyperbolic 
equilibrium of the fast subsystem. These manifolds also persist for e > sufficiently small. In 
addition to Fenichel's Theorem there are coordinate changes that simplify a fast-slow system 
considerably near a critical manifold [28| 03] if the slow flow has no bounded invariant sets. 



Theorem 2.2 (Fenichel Normal Form). Suppose So is a compact normally hyperbolic subman- 
ifold of C with m u unstable and m s stable fast directions and that the slow flow is rectifiable on 
Sq. Then there exists a smooth invertible coordinate change (x, y) i— > (a, b, v) G M mu x M ms x R n 
so that a fast-slow system (J3j) can be written as: 

a = A(a,b,v,e)a, 

b' = T(a,b,v,e)b, (6) 
v = e(ei + H(a, b, v, e)ab), 

where A, T are matrix-valued functions. A has m u positive and T has m u negative eigenvalues, 
e\ = (1, 0, . . . , 0) T G M™ is a unit vector and H is bilinear in a, b. 

The manifold So perturbs to a slow manifold S e by Fenichel's Theorem. Then this slow 
manifold is "straightened" together with its stable and unstable manifolds that become coor- 
dinate planes |42j . Therefore we can basically assume that the fast subsystem near a normally 
hyperbolic critical manifold is linear with eigendirections aligning with the coordinates. This 
will provide the basis for our discussion of dynamical behavior far away from critical transition 
points. The next classical example illustrates the definitions and shows how normal hyperbol- 
icity can fail. 

Example 2.3. Consider a planar fast-slow system modeling a fold bifurcation with slow pa- 
rameter drift [50] : 

= y x , 

y = l. 1 ] 

The critical manifold C = {(x, y) G M 2 : y = —x 2 } is normally hyperbolic away from the 
fold bifurcation point (x, y) = (0, 0) of the fast subsystem; the point (0, 0) G C is also refered 
to as a fold point. Observe that the set C a := C R {x > 0} are attracting equilibrium points for 
the fast subsystem while points on C r := C fl {x < 0} are repelling; see Figure |5J To derive an 
expression for the slow flow one can differentiate y = —x 2 implicitly with respect to r giving 
y = —2xx which yields 

1 

X ~ ~2x' 

Note that the slow flow is not well-defined at x — 0. However, by rescaling of time r — > 2xr, 
which reverses the direction of trajectories on C r , the slow flow can be desingularized. The flow 
of ([7]) for e = can be described by combining trajectories of the fast and slow subsystems; see 
Figure [2](a). A solution starting in W s {C a ) approaches it rapidly, then it follows the slow flow 
on C and finally "jumps" at the fold bifurcation point toward x = — oo. 
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Figure 2: Illustration for ([7]). (a) Singular limit e = with a candidate trajectory 70 consisting 
of two fast and one slow segment is shown, (b) Trajectory / ~f t for ([7]) with e = 0.02 and initial 
condition (x(0),y(0)) = (1.2,-0.6). 

Recall that assumption (1) in Section [1] requires a critical transition to occur at a point when 
there is a sudden change from slow dynamics to a fast repelling segment. To make this idea 
more precise we recall one more definition from fast-slow systems. In the singular limit e = 
trajectories can be considered as concatenations of trajectory segments of the fast and slow 
subsystems. A candidate [T3J ES] is defined as a homeomorphic image 70 (t) of a real interval 
(a, b) with a < b where 

• the interval is partitioned as a = to < t\ < • • • < t m = b, 

• the image of each subinterval 7o(tj_i,tj) is a trajectory of either the fast or the slow 
subsystem, 

• and the image 7o(a, b) has an orientation that is consistent with the orientations on each 
subinterval 7 (tj_i, tj) induced by the fast and slow flows. 

Note that we can also view a candidate as a trajectory of a hybrid system. If consecu- 
tive images jo(tj-x,tj) and 70(^,^+1) are trajectories for different subsystems, i.e. there is a 
transition at tj from fast to slow or from slow to fast, then we say that 70 (tj) is a transition 
point. Using candidates and transition points we can easily give a rigorous definition of critical 
transitions. 

Definition 2.4. Let p = (x p ,y p ) G C be a point where the critical manifold C is not normally 
hyperbolic. We say that p is a critical transition if there is a candidate 70 so that 

(CI) 7o(^-_i, tj) is a normally hyperbolic attracting submanifold of C, 

(C2) p = 7o(tj) is a transition point, 

(C3) and 70(^-1,^) is oriented from ■y (tj_ 1 ) to 70 (^)- 
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From a fast-slow systems perspective a critical transition occurs at a bifurcation point 
y = y p of the fast subsystem that induces switching from a stable slow motion to a fast motion. 
Definition 12 .41 can easily be generalized to more complicated invariant sets of the fast subsystem. 
For example, if p is a point that lies on a family of fast subsystem periodic orbits we can again 
define a slow flow by averaging over the periodic orbit [511 ITS] . Then the same definition 
applies. The next steps are straightforward checks whether several classical bifurcations are 
critical transitions. From now on we shall restrict ourselves to the case n — 1 reflecting slow 
variation of one parameter; without loss of generality we can assume that the bifurcation point 
of the fast subsystem is located at (x, y) = (0, 0) and we assume that g = 1 near the origin to 
simplify the exposition (in principle we will only need that g(x, y) > K > near the origin for 
some constant K independent of (x,y)). 

Proposition 2.5. Suppose m = 1 so that ([1]) is 1 -dimensional and that there is a generic fold 
(or saddle-node) bifurcation at y = 0. Then the fold bifurcation is also a critical transition. 

Proof. Near a generic fold bifurcation the flow is topologically conjugate to the normal form 
[51] of a fold bifurcation 

/ 2 

x = —y — x , 
y = e. 

The critical manifold is C = {y = —x 2 } and C a := C R {x > 0} is normally hyperbolic and 
attracting. Then the candidate 

7o = C a U{[0,-oo),x{0}} 
shows that the fold bifurcation is a critical transition. □ 

The main idea for the fold bifurcation and all the other bifurcations discussed below is 
illustrated in Figure |3j 

Proposition 2.6. Suppose m = 2 so that (TjQ) is a planar system. Suppose there is a generic 
Hopf bifurcation of the fast subsystem at y = with first Lyapunov coefficient l\ ^ 0. The Hopf 
bifurcation is a critical transition if it is subcritical (l\ > 0). If it is supercritical (l\ < 0) then 
the transition is not critical. 

Proof. By genericity of the Hopf bifurcation we can consider the normal form 

%i = yxi- x 2 + hxi(xl + x 2 ,), 

x' 2 = x 1 + yx 2 + hx 2 (x 2 1 +xl), (8) 
y> = 6. 

The equilibrium point x* = of the fast subsystem is stable for y < and loses stability at 
y = as a pair of complex conjugate eigenvalues of (D x f)(0,y) passes through the imaginary 
axis at y — 0. Suppose first that l x > and consider the candidate 

7o = {x = 0,|/<0}U5 

where S is a spiral trajectory lying in fast subsystem unstable manifold of (xi,x 2 ) = (0,0) 
at y = 0; this concludes the first part of the proof. If l\ < then the fast subsystem Hopf 
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fa) fold bifurcation 




(b) Hopf bifurcation 



c) pitchfork bifurcation 





(d) transcritical bifurcation 




Figure 3: Fast subsystem bifurcation diagrams for four bifurcations that are critical transitions. 
Solid curves indicate stability, dashed curves instability For the Hopf bifurcation in (b) only 
the projection onto (xi, y) is shown. Double arrows indicate the flow of the fast subsystem. 



bifurcation is supercritical so that there are stable periodic orbits of amplitude y/y for y > 0. 
Suppose there exists a candidate 70 through (x, y) = (0, 0) that satisfies (C1)-(C3) with 7o(i/) = 
(0, 0). Note that by (CI) we must have that 7 (t ? _i, tj) is contained in {y < 0, x\ = = X2}. By 
(C2) we note that 7o(^, %+i) cannot be contained in the bifurcating family of periodic orbits or 
in {y > 0, x\ = = £2}. Since (0,0) = (xi,X2) is asymptotically stable as an equilibrium point 
for the fast subsystem we can conclude that (C2) can never be satisfied for any candidate. □ 

The fold and Hopf bifurcations are the only generic bifurcations occurring in one-parameter 
families of equilibrium points of flows [56]. Under additional assumptions on the structure of the 
equations (e.g. assuming symmetries) one also often considers the following two one-parameter 
bifurcations: 

x' = yx + ax 3 pitchfork bifurcation, 
x' = yx — x 2 transcritical bifurcation. 

The analysis of the pitchfork bifurcation is completely analogous to the Hopf bifurcation case. 
Indeed, recall that the Hopf normal form ([8]) can be transformed into polar coordinates (r, 9) G 
(M+,5 1 ) 

r' = yr + /ir 3 , 
6' = 1. 

Proposition 2.7. The pitchfork bifurcation is a critical transition when it is subcritical (a > 0) 
and it is not a critical transition if it is supercritical (a < 0). 

The transcritical bifurcation case is slightly more interesting. 
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Proposition 2.8. The transcritical bifurcation is a critical transition. 



Proof. We can again work with the normal form. Consider the candidate 



7o = {x = 0,?/<0}U{x<0,2/ = 0} 



which shows that we have a critical transition. 



□ 



Note carefully that there is a candidate trajectory for the transcritical case that has (0, 0) 
as a transition point but that does not satisfy (CI). In particular, not all fast segments escape 
at a critical transition. Furthermore, the assumption that g ^ near the origin does suffice to 
make the transcritical bifurcation a critical transition. The fast-slow structure naturally sug- 
gests additional quantitative measures for critical transitions; see e.g. Definitions 12.91 and 13.11 
below. Furthermore our definition can easily be extended to any possible bifurcation scenario 
in a fast-slow system including higher codimension bifurcations and global bifurcations e.g. the 
lists of bifurcations in [83] can be subsumed into a fast-slow systems framework. 

Critical slowing down is an indicator of how far we are away from a critical transition point 
[86J. Recall that if we have a stable solution point X = X(t) for ([T]) and want to consider the 
evolution of perturbations X + u for ||w|| sufficiently small then 



which is the usual variational equation [5B]. It can be used to describe how quickly a pertur- 
bation of an asymptotically stable equilibrium point will decay to zero. 

Definition 2.9. For (x,y) in the attracting sheet C a of the critical manifold, perturbations 
z = (x + u,y) decay to (x,y) at an exponential rate exp(A u ). Note that X u < is negative; it 
is called the Lyapunov exponent of z. The largest Lyapunov exponent has smallest magnitude 
and is called the leading Lyapunov exponent. If the leading Lyapunov exponent of (x,y) G C a 
is 0(y a ) then we suggest to call a the recovery exponent. 

The exponent a provides a measure how quickly perturbations in the fast direction will decay 
near a bifurcation depending on the distance in parameter space to the critical transition. A 
larger a indicates slower decay. The recovery exponent can easily be calculated for the four 
bifurcations discussed above. 

Proposition 2.10. The recovery exponent a is given by 



Proof. Center manifold theory [21] implies that it suffices to consider the vector field on the 
center manifold to compute the leading Lyapunov exponent for an asymptotically stable equi- 
librium point near a bifurcation. For the fold bifurcation we find that (Q is given by 




(9) 




| fold bifurcation, 

1 Hopf, pitchfork and transcritical bifurcation. 
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Therefore H = 



—2\[—Y = 0(Y l l 2 ). For the pitchfork bifurcation one gets 



u 



_d_ 

dX 



(YX + X s 



u 



Yu. 



x=o 



The calculations for Hopf and transcritical bifurcations are equally easy. 



□ 



Although Proposition 12.101 is almost entirely obvious from a mathematical perspective it is 
often ignored in applications. In particular, we point out that different bifurcations can also 
lead to different quantitative slowing down effects. This idea is detailed for all bifurcations up 
to codimension two in 152 1. 



3 Fast-Slow Systems II: Dynamic Bifurcation 

In the previous section we have seen that fast-slow systems provide a structural view on critical 
transitions. The slow change of the parameter drives the system toward a fast subsystem 
bifurcation at which a rapid transition occurs. We suggest to quantify the fast-slow critical 
transitions further. 

Definition 3.1. Let 7q denote the first fast segment of a candidate trajectory starting at a 
critical transition point p. Let ui c {p) denote the w-limit set of 7q under the fast flow. Define 

r{p) := inf {d(p,u c (p))}, 
l*(p) := sup {d(p, uJ c (p))} ■ 

Basically P(p) is the distance to the closest fast subsystem attractor we can jump to by 
starting at a critical transition while I s (p) measures the distance to the most distant attractor. 
In Example (12.31) we have P(0, 0) = oo = l s (0, 0); the same holds for normal forms of subcritical 
Hopf and pitchfork bifurcations. However, it is interesting to note that for the transcritical 
bifurcation we have l % = and I s = oo. We can use l hS (p) to quantify what we described in 
Section [1] as "jumping to a far-away attractor". 

The theory of fast-slow systems provides a description of the flow near a fold critical tran- 
sition for the full system with < e < 1. We briefly review this result here; see also [251 SB] 
for further details. Consider the planar fast-slow system given by 

/ 2 

x = —y — x , 

y' = -e. 

Decompose the critical manifold as C = C a U {(0, 0)} U C r where 

C a = Cn{x>0} and C r = Cn{x <0}. 

For p > sufficiently small and a suitable interval Jcl, define a section A m = {(x, —p 2 ) '■ 
x G J} transverse to C a and define a section A out = {(— p, y) :|/6l} transverse to the fast 
subsystems. The next result describes the resulting flow map between the two sections [48J. 
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Theorem 3.2. Near a generic fold bifurcation of the fast subsystem the extension of C" under 
the flow passes through A out at a point (— p, 0(e 2 / 3 )). Furthermore, the transition map from 
A m to A out is a contraction with contraction rate 0{e~ K ^). 

One way to think of Theorem 13.21 is that the trajectory of the full system does not jump 
at the exact fold bifurcation point but is shifted or delayed in the slow direction by 0(e 2//3 ); 
see Figure [2](b) . For Hopf, pitchfork and transcritical bifurcations we also observe bifurcation 
delay. We shall briefly review the results for the delayed Hopf bifurcation in the simplest case; 
for details see [67J EH [69] . Consider a fast-slow system 

x> = f(x,y), 
y = e, 

with (x,y) G M 2+1 . Suppose the fast subsystem has a generic Hopf critical transition at y — 0. 
Suppose for simplicity that 

c = { Xl = = x 2 } = C a U {(0, 0, 0)} u c r 

where C a is attracting for y < and repelling for y > 0. Denote the complex conjugate pair 
of eigenvalues of (D x f)(0, 0, y) by \\^{y)- Consider a trajectory of the full system that enters 
an 0(e)-neighborhood of S a at y a and leaves an 0(e)-neighborhood at y r ; see Figure HJ The 
complex phase is defined as 

*(r)= [ T X 1 (s)ds 
Jo 

and the way-in/ way-out map n that maps a time r < to a time n(r) > is 

He[*(r)]=He[*(n(T))]. (11) 

In principle, a few additional technical assumptions are needed on the smoothness of / and 
the structure of the complex time level sets {r G C : Re[\l/(n(r))] = k} C C. Unfortunately 
these are lengthy to state in full generality (see [67J) but the following theorem provides the 
basic idea for most cases of practical interest. 

Theorem 3.3. For < e < 1 a solution j(r) = 7(et) of ( FlOl) approaching C a near y a at time 
t will be delayed and track the unstable branch C r . The map (ITTj) can be used to approximate 
the delay time II (to) and hence to approximate y r ~ n(ro) + y a - 

Further details about bifurcation delay and intricate special assumptions of Theorem 13.31 
can be found in [671 IS]- Pitchfork and transcritical transitions are treated in [49] . The delay 
effect shifts the critical transition in a slowly varying parameter space from y — to y ~ y r > 0. 
For y < we can predict that a critical transition is going to occur by critical slowing down; 
alternatively, for y > we could also observe perturbations growing exponentially. In particular, 
if we know e, y a and the type of the critical transition then we can use the theory of bifurcation 
delay (or dynamic bifurcation) to predict y r accurately. Let us point out that it is a key new 
observation that the delay effect has to be incorporated in the prediction of critical transitions 
and that it can potentially be useful to find early-warning signs. 
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Figure 4: Simulation of (JS} with the appended equation y' = e; l\ — 1, e = 0.01 and the 
initial point is (xi(0), £2(0), 2/(0)) = (0.3,0.3,-0.5). The trajectory 7 approaches C, enters a 
neighborhood [/ = {\x\\ < e} (dotted lines) at y a , follows the attracting part of the critical 
manifold exponentially closely, experiences a delay near a subcritical Hopf bifurcation of the 
fast subsystem and then leaves U at y r . 

4 Stochastic Dynamical Systems 

The next step is to incorporate stochastic effects to capture the role of noise in critical transi- 
tions. We start by reviewing several different viewpoints in the theory of stochastic dynamical 
systems. Currently the theory is less complete and structured than deterministic ODE theory. 
Therefore our presentation is necessarily less complete in comparison to deterministic fast-slow 
theory and just highlights some important ideas and methods. 

Fix a probability space (O, J 7 , P) and consider a general Ito stochastic differential equation 
(SDE) 

dzt = A(z t , t)dt + B(z t , t)dW t (12) 

where z e R N , A : R N xR4 R N , B is an I x iV-matrix and W t = (W 1)t , . . . , W N , t ) T is standard 
Brownian motion with components defined on (fl,J-, P); we always assume that the initial 
conditions are deterministic and that A and B are sufficiently smooth maps so that existence 
and uniqueness results for SDEs hold |70j . Alternatively we could also consider a Stratonovich 
SDE 

dz t = A(zt, t)dt + B(z t , t) o dW t (13) 

which can, of course, be converted to an Ito SDE and vice versa [31]. Nevertheless, it is an 
important modeling question what formulation one chooses [10] . There are several complemen- 
tary viewpoints to analyze (jT2|) - (fl3|) that are all envisioned to be helpful in the understanding 
of critical transitions. 
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Sample paths: The map u> h-> z t (u>) describes a sample path for a given randomness/noise 
u £ Q. Analyzing sample paths most closely resembles the study of ODEs as one still 
deals with trajectories. 

Transition probability: Denote the probability density of z t starting at zq at time t by 
p(z,t) = p(z,t\z ,t ) associated to ffT2l . Then p satisfies the forward Kolmogorov or 
Fokker-Planck equation 



d N d 1 N d 2 

-p(z,t) = -J2 w (Mz,t) P (z,t)) + ^^(MMM^)) (14) 

j=l 3 j,k=l 

where bjk are elements of the diffusion matrix BB T . We shall not use the associated 
backward Kolmogorov equation here that is defined via the adjoint of the right-hand side 
of (|T4|) in the variables (zo,fo)- Via the forward and backward Kolmogorov equations we 
can use the theory of parabolic partial differential equations to understand the SDE (TT2]) . 



Random Dynamical System: Under suitable conditions any SDE generates a random 
dynamical system given by a skew-product flow 

(u, z) h-> (6(t)u, tp(t, w)z) =: 9(f) (w, z) (15) 

on Q x M N ; for details of this construction see [61 H]. The key point of ( !T5|) is that it 
provides a convenient framework to analyze invariant measures. Let B denote the Borel 
a- algebra on Mr. Then a measure /ionfix M N is an invariant measure on (Q x X, J 7 x B) 
if 9/i = n and 7TQ/i = P where 7Tq is the projection onto Q. 



Detailed introductions to some aspects of stochastic dynamics can be found in [HI El 
As a first step we use the sample path approach [TS], UB] and point out what it provides in the 
context of critical transitions. Let z = (x, y) G Mr and consider the fast-slow SDE 

dx T = ~f(x T ,y T )dr + ^dW T , 
dy T = g(x T ,y T )dT + o- g dW TJ 

where the noise level (cr| + cr 2 ) 1 ^ 2 = a = a(e) is usually assumed to depend on e; the scaling of 
the fast equation aj/y/e has been chosen due to the scaling law for Brownian motion (recall: 
W\ T = \ l l 2 W T in distribution for A > 0, [27]). To understand critical transitions we would like 
to distinguish the region of ?/-values close to the transition from those far away. Let us first 
analyze the situation away from a critical transition where the deterministic critical manifold 
C is normally hyperbolic and attracting. The deterministic slow manifold is given by 

C e = {(x,y)eM 2 :x = h e (y)} 

where h t (y) = h (y) + 0(e) by Fenichel's Theorem. 

Remark: Here we follow Berglund and Gentz [18] but point out that alternative approaches 
for fast-slow SDEs are considered in |79j using random dynamical systems and in [H] using 
moment estimates and asymptotics. 
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The first goal is an estimate on the concentration of solutions to (|T6|) near the deterministic 
slow manifold. To identify a neighborhood containing most sample paths we define the process 



£ T : = Xt - h t (y T ). 



(17) 



Observe that £ T measures the deviation of the fast components from the deterministic slow 
manifold. Applying Ito's formula to ( fTTl) gives: 

d^ T = dx T — (D y h e )(y T )dy + 0{a 2 )dr (18) 

1 [f(h £ (y T ) + £ T , y T ) - e{D y h e ){y T )g{h e {y T ) + £ T , y T ) + 0{ea 2 )} dr 



e 
+ 



dW T . 



Considering the linear approximation of (fT8~j) in £ r , neglecting the higher-order Ito term 0(ea 2 
and replacing y T by its deterministic version yf et gives 

< = iMy^Cdr + [% - a g (D y h e ) (y**) 
dy d T et = g(K(y d T et ),y d T et )dr, 



dW T1 



(19) 



where A e is defined as 

A e (y) = (D x f)(h e (y),y)-e(D y h 6 )(y)(D x g)(h e (y),y). 
Then define X T := crj 2 Var(^°) which satisfies a fast-slow ODE [18] given by 

eX = 2A e (y)X + l, 

v = g(he(y),y)- 

The slow manifold of fl20l) is 

0(6) 



(20) 



C* = ^(X,y)eR 2 :x = H e (y) 
The neighborhood of C e is then defined as 

N(r;C e ) :=((x,y)E 



2A e (y) 



5 2 . (^-/^e(y)) 2 <r 2 



(21) 



Essentially this provides a strip around C e with width depending on the variance and the 
linearization of the SDE; see Figure |5] for an illustration. A detailed statement and proof of the 
next theorem can be found in |18|. 



Theorem 4.1. Sample paths starting on C t stay in N(r;C e ) with high probability for times 
approximately given by O (ee r ^ 2a ^ 



Theorem 14.11 is reminiscent of the classical Kramer's time to escape from a potential well 
[29|. From the methodology we have just reviewed, we can obtain several important conclusions 
for critical transitions: 
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Normal hyperbolicity provides the separation criterion for sample paths into two major 
regimes. To develop an early-warning sign we expect to pass through a normally hyper- 
bolic metastable regime before entering a region near the critical transition. 

Sample paths are likely to stay inside a neighborhood that scales with the variance. Hence 
if there is a critical transition due to the loss of normal hyperbolicity of a slow manifold 
we expect the variance to increase as we approach the transition; see also 



Regarding the previous point, it is easily seen from the techniques in [18] that this variance 
increase can be established rigorously for normal forms of bifurcations under suitable 
boundedness assumptions on the noise. In particular, there is no need to refer to heuristic 
arguments or autoregressive models as one can directly prove this result pathwise. 



The noise in the slow variable is of higher-order in the drift term of ([18]) . In the diffusion 
term we have noise contributions crf/y/e and a g so that if <jj and a g have the same 
asymptotic dependence on e we can again neglect the slow variable noise; hence we shall 
only consider the case a g = from now on. 



Motivated by the previous discussion and Fenichel's Normal Form Theorem 12 .21 we are going 
to model paths away from a critical transition by the system 

dx T = ^{-x)dT+fdW T) 

dy T = ldr, 

which decouples with y T = y + r and where < a = 0(1). The fast equation of (|22|) is just 
the classical Ornstein-Uhlenbeck (OU) process [31]. The solution starting at r = is 

Xt = Xo e- aT/e + [ e~ a ^- p ^ e dW p . (23) 
V e Jo 

Since our initial condition is always assumed to be deterministic we get that x T is a Gaussian 
process with mean and variance given by 

E[x T ) = x e~ aT/e , 

The correlation is easily computed as 

E[x T x s ] = (-—) e- a ^ T+s ^ + ^ e ~^s\l\ 
1 1 \ 2a J 2a 

Observe that on a slow time scale r of order 0(1) the terms involving e~ Kr ^ are extremely 
small. In the limit r — > oo we have the stationary variance given by 

2 

lim Var(a; T ) = — . (24) 
-r^oo 2a 

It is crucial to note that the variance is constant in the limit r — > oo but is already approx- 
imately constant up to exponentially small terms after a slow time r = 0(1). Therefore we 



16 



0.04 



dN 




ON 

-0.04 ' 1 1 1 

-2-1012 

y 



Figure 5: Simulation of f[2"2"j) with e = 0.02, a = 0.1 and a = 1. A sample path is shown 
(black) that stays inside the neighborhood N(r; C e ) = N with boundaries indicated by dN 
(dashed blue). We also plot a neighborhood defined by the variance a 2 (dashed red) and the 
slow/critical manifold C e (gray). 

expect that systems far away from critical transitions are characterized by a variance without 
a significant trend. The expectation and autocorrelation vanish in the limit r — > oo and also 
all other moments are constants. 

Furthermore our normal form approach also suggests a way how to estimate the parameters 
from a single sample path. First we detrend the fast variable data in a sufficiently long normally 
hyperbolic phase. Using the model (122]) for the fast dynamics gives 

dx T = —ax dr + a dW T (25) 

where a := a/e and a = cr/y/e. Well-known statistical techniques for parameter estimation [82J 
can then be applied to ( 1251) to find a and a from the detrended data; for example, by using a 
maximum likelihood estimator or many other possible estimators [55J. This provides the correct 
order of magnitude for e from a since a = 0(1) by assumption. Then we can conclude the order 
of a from a. This shows that the initial data far away from a critical transition can have crucial 
value for its prediction. Note however, that we have assumed that detrending transforms the 
system into Fenichel normal form. The following example shows the problems that can result 
in this context. 

Example 4.2. We have assumed that the fast-slow SDE (T22]) is already in Fenichel normal 
form. In general, we only have the equation for the deterministic critical manifold C — {(x, y) G 
1R 2 : f(x, y) = 0}. We can describe C as a graph h : IR — > M so that 

C = {(x,y)eR 2 :x = h(y)} 



17 



where f(h(y),y) = 0. Then the coordinate change X = x — h(y) gives that in the new 
coordinates C = {X = 0}. Let us consider the following example 



dx t 
dyt 



(y - x)dt + adWi 
eg(x,y)dt. 



(26) 



We set (X t , Y t ) = (x t — y t , yt) which transforms fTSBI) to 



dX i 
dY t 



-x - eg(X + Y, Y)dt + adW t 
eg(X + Y, Y)dt. 



Then the variance of Xt and X t are equal since 



Var(Xt) = Var(x 4 - y t ) = Var(xt) + Var(y t ) - 2Cov(x t , y t ) = Var(x t ) 



since yt was assumed to be deterministic. In general, this cannot be assumed so that stochastic 
slow variables definitely will change the result; see also equation (|T8|) . 

Hence we have identified the problem of coordinate transformation effects on critical transi- 
tion indicators as a topic for future study. We are not going to consider this problem here but 
point out it arises immediately as a key problem once a mathematical framework for critical 
transitions is considered. Even without the parameter estimation problem in a normally hyper- 
bolic regime away from the transition we must consider this problem; indeed, we might want 
to assume for theoretical analysis that systems are in normal form near the critical transition 
point. 

5 Stochastic Indicators 

A natural question for finding indicators of critical transitions for SDEs is to ask what happens 
to the deterministic fold, Hopf, transcritical and pitchfork bifurcations under the influence 
of noise. This question already raises a few unanswered mathematical problems of stochastic 
bifurcation theory [HIE]. We briefly review two viewpoints about what a "stochastic bifurcation" 
should be. Suppose we are given a family of random dynamical systems (RDS) {O^} for a 
parameter j/6R associated to the SDE (fl2"l) or (|T3|) . Assume that is a family of invariant 
measures for the RDS which can be viewed as analogs for invariant sets in the deterministic 
case; for example, if the family of RDS has an equilibrium point at z = then fi y = So is a 
natural example. We say y = yu is a dynamical or D-bifurcation point if in each neighborhood 
of yu there is a family of invariant measures v y such that v y ^ jj, y and u y — > \i y as y — > y D in 
the topology of weak convergence. Basically this notion presented in |6] tries to capture the 
deterministic viewpoint of bifurcations in a stochastic context. Instead of "qualitative changes" 
for invariant measures one could also look at "qualitative changes" for densities associated to 
the SDE f|T2l . Suppose p y s (z) = p y s is a family of probability densities solving the stationary 
Fokker-Planck equation 




(27) 
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It has been suggested to consider a qualitative change in the family of densities apf a bi- 
furcation point |40j . For example, if the density p y s is unimodal for y < yp and bimodal for 
y > yp then y — yp is called a phenomenological or P-bifurcation point; substantial progress 
has been made to understand D- and P-bifurcations [TT| [T2] and associated problems of random 
attractors [221 EE] and stochastic normal forms [6] but this theory has not yet been applied to 
detecting critical transitions. 

We are going to consider an example by Arnold and Boxler [TJ E] where explicit calcula- 
tions for D- and P-bifurcations are possible; this will demonstrate that the stochastic bifurca- 
tion concepts can complement existing techniques to predict critical transitions. Consider the 
parametrized family of Stratonovich SDEs 

dx t = {yx t - x\)dt + ox t o dW t (28) 

representing one possible interpretation of a transcritical bifurcation with noise. Note that we 
could also make the parameter y slowly varying; since we are working on the fast time scale t 
this would amount to using the deterministic equation 

dy t = edt. 

However, the parametric analysis is already very complicated and we shall restrict to this 
situation here. The Ito SDE associated to (1281) is 

dx t = (^yxt — x 2 t + -a 2 x^j dt + ax t dW t . (29) 

Note that we are dealing with multiplicative noise with respect to the trivial solution xt = 0. 
An explicit formula [6] for the random dynamical system defined by (1281) is 

xe yt+crWt (w) 

U)Jt,0j)x = 7 . (30) 

Ergodic invariant measures fi for RDS on M. are always random Dirac measures i.e. of the form 
8x (u}) [S]. From formula ( 130]) it follows that there are two families of ergodic invariant measures, 
one supported at given by /i^ = Sq and one family = S x »^ supported on the random point 
that makes the denominator in fl30l) zero as t — > ±oo: 



-(/ °° ey^^dty 1 for 2 /<0, 
X * y{Uj) = ^ ( e«+° w *Mdt) ~ l for y > 0. 



It is very important to note that for y ^ the random dynamical system (13"U|) is only defined 
for t > on the random domain given by 



yy ' \ [0, oo) for y > 0. 



One idea explored further in Sections [6] and [7| is to analyze the role of this random boundary 
and how it signals the explosion/critical transition of the process. Having explicit expressions 
for the ergodic invariant measures one can show the following bifurcation theorem [7j. 
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Theorem 5.1. The SDE (|28|) modeling a transcritical bifurcation with multiplicative noise has 
a D-bifurcation at y = 0. 



The D-bifurcation point provides us with an analog of the deterministic transcritical bi- 
furcation point. We know that a critical transition is induced by a deterministic transcritical 
bifurcation at y = 0. The stochastic formulation (1281) also provides us with additional informa- 
tion. Consider the stationary Fokker-Planck equation associated to (T28]) - (T29|) 



d ( (, a 2 , 2 \ A d 2 ( ' a 2 x 2 



= -E U [s + ^ ~ x ) P ° W ) + M {—^ x) ) (31) 

where p v s {x) denotes the stationary probability density of p y (x,t). One normalizable solution 
of ( l3Tj) for y > is given by 

P y s (x) = ^^~ le_!l ( 32 ) 

for x > and p y s {x) = for x < 0; here N y is a computable normalization constant [6]. From 
( I32p we see that the density has a singularity at x = for y G (0,a 2 /2) and is unimodal for 
y > a 2 jl. Hence there is a P-bifurcation at yp = cr 2 /2; see also [9T1 90J to make the non- 
equivalence of the two densities precise. We can either use the backward Kolmogorov equation 
or a symmetry argument to obtain another P-bifurcation at y — —a 2 /2 giving the final bifur- 
cation diagram shown in Figure [6j 




Figure 6: Bifurcation diagram for the Arnold and Boxler example ( 12 8p with a = V0.8. There 
are two P-bifurcations at yp± = ±cr 2 /2 = ±0.4 and a D-bifurcation at yp> = 0. The stationary 
densities are plotted at the values y = ±0.8 and y = ±0.2 to show the qualitative change for 
the P-bifurcation. The deterministic transcritical bifurcation diagram is drawn for orientation 
purposes. 

It is very interesting to calculate some of the moments of p y s (x) explicitly; we fix y > and 
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consider ( 132|) . For the mean m s (y) we find 




The variance v y s is 



8 
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Figure 7: Parameter-dependent variance v v s for the Arnold and Boxler example fl28|) . The 
formula is given in equation f[55|) . 

A direct plot in Figure [7] shows that the variance is non- monotone for sufficiently small noise 
a. In particular, there is a local minimum and a local maximum for y > yp. By symmetry this 
situation also holds for y < yp. There are several observations that we can conclude from the 
previous discussion regarding critical transitions: 

• There is a P-bifurcation preceding a D-bifurcation for the transcritical bifurcation oc- 
curring in (1251) . In particular, the P-bifurcation point can potentially be used as an 
estimator/predictor for the critical transition point. 

• The D-bifurcation point could be used to form the "organizing center" for the critical 
transition in analogy to the bifurcation point in the deterministic case i.e. it provides us 
with a rigorous definition of a reference point where the jumps occur. 

• The unstable deterministic equilibrium branches naturally appear as boundary points for 
the stationary Fokker-Planck equation. 

• The variance, and also other moments, can vary rapidly and non-monotonically near a 
critical transition point; cf. the situation in Section [71 
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• For the non- stationary case, the boundaries for the dynamical system have to be random 
since there is always a positive probability that a sample path reaches any positive or 
negative x-value. We shall discuss this problem in Section El 

We remark that the example by Arnold and Boxler is rather special since we were able to 
find explicit solutions for all interesting quantities. In many cases we would have to rely more 
on numerical methods; see, for example, [621 EZ] • Furthermore, it has been shown that D- and 
P-bifurcations do not always have to appear together and that the situation for Hopf bifurcation 
is much more complicated than anticipated [5]. However, examples with multiplicative noise 
are expected to appear naturally in many control problems since approaching an instability 
also might want to reduce the noise level. In this scenario it is easy to understand that for 
multiplicative noise a rising variance early-warning sign can fail [52]. Therefore we suggest that 
P-bifurcation indicators should definitely be added to the toolbox of possible early-warning 
signs. 



6 Noise-Induced Transitions 

The term "noise-induced transitions" groups together a rather wide spectrum of phenomena; 
other terms that are related to it are stochastic resonance, coherence resonance, self-induced 
stochastic resonance [5B]. The different concepts share a common feature: the noise induces 
dynamical behavior in a system that cannot be found in the deterministic version. To illustrate 
the situation consider the following planar fast-slow SDE 



dx T = \{y- x 2 )dr + ^dW T 
dy T = g(x T ,y T )dr, 



(34) 



modeling the fold critical transition. If we consider f l34jl on the fast time scale t = r/e and then 
consider the singular limit e — > we get 

dx t = (y-x 2 )dt + adW t . (35) 

Fixing some y > a sample path starting for some x ~ ^Jy is expected to stay with high proba- 
bility near the stable equilibrium of the deterministic system at x = ^/y if o is sufficiently small; 
see Theorem 14.11 The problem is that it can escape from a neighborhood of ( 1351) eventually 
with some probability i.e. there is a large deviation. Classical theory of large deviations [29] 
predicts how likely it is to escape from an attracting equilibrium. The deterministic version of 
f |35|) is a gradient system with potential 

U(x) = —yx + -x 3 . 

The potential difference to go from the stable equilibrium x = ^Jy past the unstable equilibrium 
at x = —y/y is 

H := UUy) - U{-^y) = 

Then it is a classical result in large deviations [291 [T8] that it takes a time t = 0(e 2H ^ 2 ) 
for an excursion past the unstable equilibrium to occur. If y = 0(1) and < a 1 then 
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these excursions are extremely rare and one expects that the fast-slow system (l3"lj) behaves 
deterministically and that Theorem (I3.2p applies to analyze the critical transition. The key 
point for this line of reasoning is that we have assumed that 

< a < y/e < 1 (36) 

for equation ( 135|) i.e. that the noise is small with respect to the time scale separation. In fact, 
one can show that excursions are very likely if the roles in ( 136]) are reversed |18j . 

Theorem 6.1. Consider the SDE ( 1341) and suppose g = 1. If a <C y/e then critical transitions 
before the deterministic fold bifurcation point occur with very small probability. For o 3> y/e 
critical transitions before the deterministic fold bifurcation occur with very high probability. 



The detailed estimates and the derivation of the scaling law can be found in [18]. Theorem 
16. II confirms our intuition that noise larger than the time scale separation can make the system 
jump away from an attracting critical manifold and that a fast-slow system with very small 
noise should closely resemble the deterministic situation. We also say that 

a « y/e 

marks the intermediate regime. Similar results should also hold for transcritical and pitchfork 
bifurcations but with a different scaling law. The situation is less studied but the results in [18] 
indicate that 

a « e 3/4 (37) 

is the intermediate regimes for the transcritical and pitchfork bifurcations. An additional prob- 
lem arises when the slow variables representing the parameters have non-trivial slow dynamics. 
Consider the following stochastic van der Pol equation (see also [58]): 



dx T = \ [y T - ?f + x T j dr + ^= e dW T , 
dy T = (a — x T )dr. 



(3? 



For a > 1 the deterministic equation has a unique globally stable equilibrium at x = a. The 
deterministic critical manifold is 

C=^(x,y)eR 2 :y = ^--x 

It is normally hyperbolic away from the two fold points x = ±1 and naturally splits into three 
parts 

c a <- =cn{x< -i}, c r = cn{-i<x <i}, c a ' + = {x>i} 

where C a,± are attracting and C r is repelling. In Figure [8] a direct numerical simulation using 
the Euler-Maruyama method for SDEs [38] is shown. 

Observe from Figure [S] that the sample path is not even close to the deterministic solu- 
tion which converges to the deterministic equilibrium at x — 1.05. A noise induced transition 
has occurred near the deterministic equilibrium point close to the fold point at x = 1. This 
transition induced a sample path that resembles classical relaxation oscillations; for an asymp- 
totic analysis of scaling laws in the double limit (e, a) — > (0, 0) we refer to [64l [65] . From the 
discussion in this section we can conclude the following for critical transitions: 
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Figure 8: Single sample path (black) for equation (I38I) with parameter values (e, a, a) = 
(0.05,1.05,0.1). The critical manifold C (grey) also shown. The path was started at 
(x(0),y{0)) = (2,2/3) and has been stopped at r = 2400. 

• Critical transitions are expected to occur before reaching the neighborhood of a deter- 
ministic bifurcation point if the noise level is larger than the time scale separation. 

• If the noise is small compared to the time scale separation (e.g. o <C y/e in the fold 
transition) we expect the deterministic bifurcation point to be a good prediction for the 
location of the critical transition. 

• Scaling laws between noise and time scale separation will play a crucial role whether 
critical transitions are predictable at all and what phenomena can occur as we approach 



• A slow variable/parameter with non-trivial dynamics can cause very complicated noise- 
induced transitions if g is not bounded away from zero near the bifurcation point. The 
situation is even more complicated once multiple slow variables are considered [52], EU] . 

7 Variance I: Analysis 

In this section we calculate the variance before a critical transition for several bifurcations in 
the singular limit. We consider the fast-slow SDE 



for (x,y) G M 2 and a > is constant. The function f(x,y) will be the vector field for the 
normal forms of the fold, transcritical and pitchfork bifurcations. Since we are only interested 
in the moments before the transition, we consider the normal forms only for y < as given in 
Section |2j In the singular limit e — > 0, the fast subsystem is one-dimensional with transition 



a transition 




dx t 
dyt 



f(x t ,y t )dt + adW t 
edt, 



t- 



(39) 
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probabilities p y (x,t) = p y (x,t\xo,to) satisfying the Fokker-Planck equation 

= -^(f(x,y)pv(x,t)) + y|^M) ( 4 °) 

posed on some interval (a, b) C K with initial condition p y (x, to\xo, to) = — %o)- The 
probability current J is defined by 

2 rj 

J(x,t) = f(x,t)P(x,t) - p»(x,t). 

Let us assume that there is a stationary distribution p y = p y (x) for the process then (1401) 
reduces to 

^(/(s, - y j^&ix) = (41) 

which means that J = J(x) satisfies J'(x) = and hence J(x) = constant; if we add reflecting 
boundary conditions then J = and it follows that 

f( x , y ) p y( x )-?-—py( x ) = o. (42) 

The last equation can be integrated directly to give the classical potential solution 

Vs\ x ) — T7 ex P ( ^ / 2 — ® w 



a* 



where M is the normalization constant for the probability distribution M = f p v s (x)ds. For 
each of the normal forms we choose the boundary points as follows: 

fold / = fi(x,y) = -y - x 2 (a, b) = {-yf 1 }}, oo), 

transcritical / = f2(x, y) = yx — x 2 (a, b) = (y, oo), (43) 

pitchfork / = f 3 (x, y) = yx + x 3 (a, b) = (—/=!/, V~y)- 

The choices are motivated by two factors. In Section we observed that the random 
dynamical system induced by the SDE ( 13"9|) is described by limiting its domain to points which 
do not escape. We eliminate the random boundaries and consider the unstable equilibria (i.e. 
the repelling parts of the critical manifold) as boundaries. Furthermore, our choice of reflecting 
boundaries enforces the condition that transitions only occur after the deterministic critical 
transition. We get the following stationary densities 

fold pl x {x) = £ exp (£ [-yx - |x 3 + l{-yf' 2 ]) , 

transcritical P% 2 ( x ) = j% ex P (Jr [\yx 2 - \x 3 - \y 3 ] ) , (44) 

pitchfork P%{x) = ^ exp [\yx 2 + \x A + \y 2 ]) . 

By comparing (|44p to the Gaussian density of ( l2"3"j) . we observe a transition from symmetric 
to asymmetric behavior for the fold and transcritical transitions. However, the density for the 
pitchfork transition is still Z 2 -symmetric with respect to x \- > —x. Furthermore there are no 
P-bifurcations for any p v s ■(#) for y < and j = 1,2,3. This shows that symmetry-breaking 
and P-bifurcations are not necessarily early-warning signs of critical transitions. 
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Figure 9: Variances Var for flUj) depending on the parameter y with e = 0; transcritical (green), 
pitchfork (blue) and fold (red) transitions are considered. Starting from y <C — 1 the variance is 
almost constant, then we see that for all three cases there is a clearly visible rapid increase in the 
variance as the deterministic critical transition is approached. However, due to the reflecting 
boundary conditions we have chosen for the singular limit Fokker-Planck equation, the variance 
decreases again near y = 0. 

Figure [9] shows the variance of each distribution as a function of the parameter y for a 
given fixed noise a = 0.1. Starting the parameter from y ^ —1 and increasing it, we see that 
for all critical transitions there is a rapid increase in the variance as the deterministic critical 
transition is approached. This confirms the observations and predictions from Section [T] for 
our normal form SDE models. However, we also observe that there are local maxima for each 
curve as we increase y further. The local maxima are caused by our modeling approach using 
the reflecting boundaries; the density becomes more and more confined near the stable critical 
manifold as we approach y = 0. Note that this does not contradict results using a sample paths 
approach as for sample paths the scaling of the variance is calculated without boundaries at 
unstable equilibrium points and for e > 0. Another interesting conjecture from Figure [9] is that 
the additional local maxima that we have obtained using reflecting boundaries can be viewed as 
the locations where a linearized approximation fails. More precisely, when y -C — 1 then we are 
in a normally hyperbolic regime and linearization and results about OU-process are applicable. 
When we get closer to the critical transition, nonlinear effects and noise-induced phenomena 
have to be taken into account. Furthermore it is easily calculated from the formulas (|44j) that 
the local maxima of the variance move closer to the critical transition if we decrease the noise 
level. This shows that by choosing boundary conditions for the Fokker-Planck equation we 
not only guarantee the existence of a normalizable density in the singular limit but also obtain 
additional information about critical transitions by identifying an easily-to-calculate indicator 
("the local maximum") beyond which linearized theory definitely fails. This shows that a dy- 
namic sample paths viewpoint (e > 0) and a singular limit (or quasi-static, e = 0) approach to 
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critical transitions can nicely complement each other. 

From Figure Owe can also conclude that the variance curves for the transcritical/pitchfork 
transition are substantially different from the fold transition. Since the two cases also have 
different recovery exponents for slowing down (see Proposition 12.101) it should be possible to 
distinguish between them using early warning signs. 

8 Variance II: Numerical Simulation 

To relate our results in Section [7] more directly to techniques used in applications we consider 
numerical simulation of sample paths [3H1 113 E2] • As a first question we address what happens 
to the variance for < e 1 in comparison to the singular limit calculation Fokker-Planck 
calculation. 
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y 

Figure 10: Variances Var depending on the parameter y; transcritical (green), pitchfork (blue) 
and fold (red) transitions are taken from Figure [9] with a = 0.1. The black curves have been 
computed from 1000 sample paths with (a, e) = (0.1, 0.02). A path beyond the unstable critical 
manifold at some y = y c (see boundaries in equation (H3j) ) is counted as an escaped path and 
is not considered for the variance with y > y c ; note that the colored curves from Figure [9] 
have been computed with reflecting boundaries and e = 0. The figures (al),(bl),(cl) show the 
variance and (a2),(b2),(c2) the percentage of escaped trajectories for the fold, transcritical and 
pitchfork transitions respectively. 

Again we consider the fast-slow SDE (1391) for the fold, transcritical and pitchfork normal 
forms given inH3j Figure ITOl shows the variance of the x- variable, for each value of y, calculated 
from 1000 sample paths. More precisely, if we index the sample paths by j = 1,2, ... , 1000 
we compute the variance of the fast variable {xl}j for a fixed time t; since y = et, we ex- 
pect to re-compute an approximation to the variance for the stationary distributions p v s (x) if 
e is sufficiently small, as long as we are not too close to y — where noise-induced transi- 
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tions and reflecting boundary effects are dominant. We have fixed the parameter values to 
(a, e) = (0.1,0.02) which means for the fold bifurcation we rarely expect noise-induced transi- 
tions. Due to the different scaling laws for the transcritical and pitchfork bifurcations, we do 
expect noise-induced transitions in this case; cf. [18] and equation (1371) . The percentage of 
escaped trajectories is shown in Figure [T07a2).(b2).(c2). The computed variance of the sample 
paths is shown in Figure [T0T al).(bl).(cl) as black curves. 
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Figure 11: Sample variances Var from depending on the parameter y; (a) fold bifurcation, 
(b) transcritical bifurcation and (c) pitchfork bifurcation. The black curves have been computed 
from 1000 sample paths with (a, e) = (0.1, 0.02). A path beyond the unstable critical manifold 
at some y = y c (see boundaries in equation fH3|) ) is counted as an escaped path and not 
considered for the variance with y > y c . 

Note that our initial prediction of variance increase from Section [7J is correct but our simple 
stationary distribution method fails to capture the results correctly very close to the transition 
point. This is expected as sample paths are counted as escaped path for the numerical simu- 
lation once they reach the boundaries defined in fH3|) ("absorbing boundaries", e > 0) whereas 
the Fokker-Planck calculation in Section [7J assumed reflecting boundary conditions and e = 0. 
This shows that due to the reflecting boundaries the variance is decreased near the transition 
point. The interesting conclusion from Figure [TT1 is that different modeling techniques can pro- 
duce different estimates for the moments in critical transition normal forms near the transition 
point. As long as we are far enough away in our approach the theories match up. This suggests 
to focus on this initial regime away from the bifurcation; this analysis is carried out in detail 
for all bifurcations up to codimension two in [52J. 

However, a major problem arises in a practical context, if we only have a single sample path 
to predict a critical transition, say j t = (x t ,yt) for t G [0,T]. Usually one computes an early- 
warning sign by considering a finite time interval (or window) of length s < T and computes 
the sample path variance for this time interval [75]. Suppose jt is known on a grid of times tj 
with to = and t/v-i = T so that iV* time points fall into an interval of time length s. Then 
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Figure 12: Sample path near a fold critical transition (black); parameters are (a, e) = (0.1, 0.02). 
The deterministic critical manifold C is shown in grey and two subsets are marked (dashed red) 
which correspond to windows of length y « 0.2861. From these two windows we compute two 
sample variances V{t\ 2 ) = V\p where et 1 « —0.7 and et\ « —0.02 according to fT4"5"l) . The mean 
values 2 ) = /ii^ are marked with red dots. The variance is indicated by a red vertical lines 
[fij — Vj,fij + Vj] for j = 1,2 that have been centered at the mean values and stretched by a 
factor of 20 to make the variances visible. 

the sample mean for the fast variable x for some t* G [s, T] is 

Hp) := - 8,f]) = ± E x *i 

tjE[t*—8,t*] 

and the sample variance is 

V(f) := Vor([f - M*D = ^ E - MP - Ml)} 2 • (45) 

Figure [TT1 shows the sample variance for (a, e) = (0.1,0.02). A window of size s ~ 14.3051 
is used which corresponds an interval of length w 0.2861 for y as y — et — 0.02t. For the 
transcritical and pitchfork bifurcations in Figure fTlTb)-(c) we obtain shifted versions of the 
stationary variances i.e. the variance increases but local maxima are moved towards the critical 
transition. This is expected since the sample variance "lags behind" the stationary estimator 
that is computed at a fixed y for < e < 1. 

The sample variance indicator for the fold transition in Figure [TTT a) shows a clear mono- 
tone increasing deterministic trend and does not seem to lag behind the stationary variance 
calculation/simulation. This can be explained easily from the fast-slow geometry of the SDE 
as follows. Consider a single sample path near the fold transition shown in Figure [T2l at param- 
eter values (cr, e) = (0.1,0.02). In Figure [12] two subsets of the deterministic critical manifold 
are marked (dashed red) which correspond to windows of length y w 0.2861. From these two 
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Figure 13: Plot of the lag-/c autocorrelation with k = 0.002 for 10000 sample paths for each of the 
three one-dimensional critical transitions: fold (red), transcritical (green) and pitchfork (blue). 
The solid thin lines are numerical data and the thick dashed lines are approximations (quadratic 
for the fold and linear for transcritical/pitchfork). Parameter values for the simulation are 
a = 0.1 and e = 0.02. 

windows we compute two sample variances V(t1 2 ) = V\p, where et\ ~ —0.7 and et\ « —0.02 
according to (145)) . The mean values /^(t 12 ) = A*i,2 are marked with red dots. The variance is 
indicated by a red vertical lines [fij — Vj,fij + Vj) for j = 1,2 that have been centered at the 
mean values and stretched by a factor of 20 to make the variances easier to visualize. It is 
now obvious why the variance must increase "deterministically" near fold critical transition if 
measured using ( )45|) : the critical manifold is locally parabolic and has much higher curvature 
near y = 0. Since the window size for the measurement has to be rather large to measure 
anything meaningful, the sample mean fi 2 is located further away from the critical manifold. 
Hence the sample variance will be larger due to geometric considerations and without even 
considering the noise effect. A good way to think about the situation is to project the subsets 
of the sample path corresponding to the two measurement windows onto the vertical red lines 
in Figure [121 The same argument does not hold for the transcritical and pitchfork bifurcations 
as the stable critical manifold before the transition is given by x = 0. This shows that practi- 
cal measurement techniques have to be applied and interpreted very carefully if only a single 
sample path is available. 

9 Autocorrelation 

Increasing autocorrelation has been proposed as an early warning sign for a critical transition 
[371 EHl [75] . As a first step we calculate the autocorrelation from numerical simulation averaged 
over 10000 sample paths for the normals forms of the fold, transcritical and pitchfork transitions; 
see equations (|39l) and (SSJ) • The lag-fc autocorrelation can be estimated from a time series 
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Figure 14: Plot of the lag-fc autocorrelation with k = 0.002 for 2 sample paths for the tran- 
scritical transition, (a) The thin lines are numerical data, (b) The thick lines are linear 
approximations of the curves in (a). Parameter values for the simulation are a = 0.1 and 
e = 0.02. 

(xi, X2, ■ ■ ■ , x n ) by the formula 



where \i and v are the sample mean and variance. We counted a sample path as an escaped path 
once it leaves the set {(x, y) G M 2 : x > — 1} for the fold and transcritical transitions; for the 
pitchfork transition we consider sample paths only inside the set {(x,y) G M 2 : \x\ < 1}. Figure 
[TBI shows the results for the lag-/c autocorrelation with a short lag of k = 0.002 computed from 
a subsegment of the sample path of length 8k. We do not discuss the different choices regard- 
ing the lag k or the choice of time series subsegments but remark that practical applications 
might have to deal with short time series data. There is a visible increasing trend in the auto- 
correlation for all three critical transition point. The autocorrelation for the transcritical and 
pitchfork transitions increases linearly and the two cases are virtually indistinguishable by this 
measure. The fold autocorrelation seems to increase quadratically. This shows that the increase 
in autocorrelation can be found in our SDE normal forms as an indicator for a critical transition. 

As for the variance, it is more problematic to interpret the autocorrelation as an indicator 
for a single sample path. The problem is demonstrated in Figure HH for two sample path 
approaching the transcritical transition. The autocorrelation fluctuates rapidly as y slowly 
increases; see Figure [141(a). As a first approach to check whether it is increasing or decreasing 
we consider a linear approximation as in Figure [131 These lines are shown in Figure [T4T b) and 
one increases (green) while the other decreases (black). We know that on average we expect an 
increasing autocorrelation but we would make an incorrect prediction from the black sample 
path. This demonstrates a need for a detailed analysis of the dependence of different indicators 
on the parameters. For example, for the autocorrelation we have the system parameters (e, a) 
and the measurement parameters (k, n) for the l&g-k autocorrelation of a time series of length 
n. 




n—k 
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10 Discussion 



In this paper we have given an overview of the mathematical tools that can be applied to 
critical transitions. Our main viewpoint is that studying normal-form fast-slow stochastic dy- 
namical systems should provide an additional route to understand critical transitions beyond 
studying models arising directly from applications. Standard methods from fast-slow systems 
have been used to formalize the definition of a critical transition. As the next step, different 
viewpoints from stochastic dynamics were reviewed and their contributions to the prediction 
of critical transitions was discussed. For example, we have pointed out that variance increase 
immediately follows from well-known results of sample paths analysis or that P-bifurcations 
could act as a novel prediction mechanism. Then we focused on the variance as an indicator in 
the setup of normal forms and used analytical, numerical and geometric ideas to understand the 
increasing variance near a critical transition. Throughout our analysis we highlighted several 
challenges that arise in the modeling process including noise types (additive/multiplicative), 
problems with single sample paths as well as scaling laws for noise-induced phenomena. 

We have not discussed further mechanisms and early warning signs that have been reported 
in applications: 

(a) The change in spatial structure of a dynamical system can often be used as an indicator 
for an upcoming transition EE] . One could hope that bifurcation theory for pattern 
formation is applicable in this case [73| BXj; in particular, reaction-diffusion PDEs might 
be the best starting point. The stochastic theory for SPDEs is much less developed [30] 
but statistical indicators are still expected to exist. 

(b) We have focused primarily on the one-dimensional critical transitions (fold, transcriti- 
cal, pitchfork). Although the pitchfork transition immediately gives results for the Hopf 
transition if the noise is only in the radial component, it does not capture its complete 
dynamics. The analysis of stochastic Hopf bifurcation is much more complicated than 
one-dimensional stochastic bifurcations jlSJ EJ [T7]. We expect that the general analysis 
can be particularly complicated by noise correlated between the two fast variables. 

(c) Global bifurcations can induce drastic shifts in dynamical systems {3D, 89J. In this respect, 
it becomes evident that we should also address critical transitions for iterated maps since 
they appear as Poincare maps for differential equations; for example, it is well-known that 
critical slowing down occurs near a period-doubling bifurcation [36J. 

(d) Chaotic systems might provide special indicators that could be examined |75j . The gener- 
ation of many chaotic attractors is preceded by well-analyzed bifurcation sequences [3U [2] . 
Therefore it is conceivable that one might be able to modify or extend existing methods 
to yield early warning signs. 

(e) Fast-slow systems with three or more dimensions have not been discussed here. One ex- 
ample are fold bifurcations with two slow variables and one fast variable [331 |8T] which 
occur generically on one-dimensional curves. Small oscillations can occur before a trajec- 
tory reaches a fold bifurcation and jumps to a far-away attractor. This behavior could be 
used as an indicator to predict a critical transition; a detailed review of the deterministic 
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case in the context of mixed- mode oscillation can be found in [24J. Stochastic folded 
nodes are discussed in [20J. 

(f) We have also not discussed the effect of noise on delay. The main point in this context is 
that small noise can reduce the deterministic delay effect discussed in Section [3j we refer 
the reader to [HU [15] and references therein for a more detailed discussion. However, let 
us note that it be very desirable to find early-warning signs before the delay-region i.e. 
calculating the precise jump time should be the second step of the mathematical analysis. 

We hope that the framework we reviewed and augmented in this paper also provides a bet- 
ter bridge between critical transitions in applications and the associated open mathematical 
challenges. It is expected that some new mathematical methods are going to be needed to 
address (a)-(f). Furthermore, we are fully aware that we have not maximized the results one 
can obtain from techniques presented here. For further results on normal forms, scaling of the 
variance and several applications see [52] . 
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of a critical transition point for deterministic fast-slow systems. Furthermore he provided many 
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anonymous referees for their helpful suggestions. 
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